Search for: All records

Creators/Authors contains: "Wojtczak, D."

« Prev Next »

Total Resources

2

Resource Type
Conference Paper

2

Conference Proceeding

0

Dataset

0

Journal Article

0

Workshop Report

0

Availability
Full Text / Resource Available

2

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Model-Free Reinforcement Learning for Lexicographic Omega-Regular Objectives

https://doi.org/10.1007/978-3-030-90870-6_8

Hahn, E.M. ; Perez, M. ; Schewe, S. ; Somenzi, F. ; Trivedi, A. ; Wojtczak, D. ( November 2021 , Formal Methods (FM 2021))
Huisman, M. ; Păsăreanu, C. ; Zhan, N. (Ed.)
We study the problem of finding optimal strategies in Markov decision processes with lexicographic ω-regular objectives, which are ordered collections of ordinary ω-regular objectives. The goal is to compute strategies that maximise the probability of satisfaction of the first 𝜔-regular objective; subject to that, the strategy should also maximise the probability of satisfaction of the second ω-regular objective; then the third and so forth. For instance, one may want to guarantee critical requirements first, functional ones second and only then focus on the non-functional ones. We show how to harness the classic off-the-shelf model-free reinforcement learning techniques to solve this problem and evaluate their performance on four case studies.
more » « less
Full Text Available
Model-Free Reinforcement Learning for Branching Markov Decision Processes

https://doi.org/10.1007/978-3-030-81688-9_30

Hahn, E.M. ; Perez, M. ; Schewe, S. ; Somenzi, F. ; Trivedi, A. ; Wojtczak, D. ( July 2021 , Computer Aided Verification. CAV 2021.)
Silva, A. ; Leino, K.R.M. (Ed.)
We study reinforcement learning for the optimal control of Branching Markov Decision Processes (BMDPs), a natural extension of (multitype) Branching Markov Chains (BMCs). The state of a (discrete-time) BMCs is a collection of entities of various types that, while spawning other entities, generate a payoff. In comparison with BMCs, where the evolution of a each entity of the same type follows the same probabilistic pattern, BMDPs allow an external controller to pick from a range of options. This permits us to study the best/worst behaviour of the system. We generalise model-free reinforcement learning techniques to compute an optimal control strategy of an unknown BMDP in the limit. We present results of an implementation that demonstrate the practicality of the approach.
more » « less
Full Text Available